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Oh' Abstract. In Online Sum-Radii Clustering, n demand points arrive online and must be irrevocably assigned 

to a cluster upon arrival. The cost of each cluster is the sum of a fixed opening cost and its radius, and the 
objective is to minimize the total cost of the clusters opened by the algorithm. We show that the deterministic 
' competitive ratio of Online Sum-Radii Clustering for metric spaces other than the line is 6*(log n), where the 

CNj . upper bound follows from a primal-dual algorithm and holds for general metric spaces, and the lower bound 

is valid for ternary Hierarchically Well-Separated Trees (HSTs) and for the Euclidean plane. Combined with 
the results of (Csirik et al., MFCS 2010), this result demonstrates that the deterministic competitive ratio of 
Online Sum-Radii Clustering changes abruptly, from constant to logarithmic, when we move from the line 
to the plane. We also show that Online Sum-Radii Clustering in metric spaces induced by HSTs is closely 
C/3 , related to the Parking Permit problem introduced by (Meyerson, FOCS 2005). Exploiting the relation to 

O ■ Parking Permit, we obtain a lower bound of i? (log log n) on the randomized competitive ratio of Online 

Sum-Radii Clustering in tree metrics, and a randomized 0{2 d \fd log log n) -competitive algorithm for the 
d-dimensional Euclidean space. Moreover, we present a simple and memoryless randomized O(logn)- 
■ competitive algorithm and a deterministic 0(log log n) -competitive fractional algorithm, which both work 

' for general metric spaces. 
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1 Introduction 

In clustering problems, we seek a partitioning of n demand points into k groups, or clusters, so that a 
given objective function, that depends on the distance between points in the same cluster, is minimized. 
Typical examples are the /c-Center problem, where we minimize the maximum cluster diameter, the 
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Sum-fc-Radii problem, where we minimize the sum of cluster radii, and the /c-Median problem, where 
we minimize the total distance of points to the nearest cluster center. These are fundamental problems 
in Computer Science, with many important applications, and have been extensively studied from an 
algorithmic viewpoint (see e.g. |[20l and the references therein). 

In this work, we study an online clustering problem closely related to Sum-fc-Radii. In the on- 
line setting, the demand points arrive one-by-one and must be irrevocably assigned to a cluster upon 
arrival. We require that once formed, clusters cannot be merged, split, or have their center or radius 
changed. The goal is to open a few clusters with a small sum of radii. However, instead of requiring 
that at most k clusters open, which would lead to an unbounded competitive ratio, we follow M7I8I and 
consider a Facility-Location-like relaxation of Sum-fc-Radii, called Sum-Radii Clustering. In Sum- 
Radii Clustering, the cost of each cluster is the sum of a fixed opening cost and its radius, and we seek 
to minimize the total cost of the clusters opened by the algorithm. 

In addition to applications in clustering and data analysis, Sum-Radii Clustering has applications 
in the location of wireless base stations, such as sensors M8I9I and antennas M3I17II . In such problems, 
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we have to place some wireless base stations and setup their communication range so that some 
communication demands are satisfied and the total setup and operational cost is minimized. A standard 
assumption is that the setup cost is proportional to the number of stations installed, and the operational 
cost for each station is proportional to its range (or a low-degree polynomial of it). 

Related Work. In the offline setting, the problem of Sum-fc-Radii and the closely related problem 
of Sum-/c-Diameterg^] have been thoroughly studied from the viewpoint of exact and approximation 
algorithms. Sum-/c-Radii is NP-hard even in metric spaces of constant doubling dimension |[T6l . Gib- 
son et al. lfT5l proved that Sum-/c-Radii in Euclidean spaces of constant dimension is polynomially 
solvable, and presented an 0(n logZi logn ) -time exact algorithm for Sum-A;-Radii in general metric 
spaces, where A is the diameter lfl6ll . As for approximation algorithms, Doddi et al. ifTOll proved that 
it is NP-hard to approximate Sum-/c-Diameters in general metric spaces within a factor less than 2, 
and gave a bicriteria algorithm that achieves a logarithmic approximation using O(k) clusters. Subse- 
quently, Charikar and Panigraphy Q presented a primal-dual (3.504+e)-approximation algorithm for 
Sum-A;-Radii in general metric spaces, which uses as a building block a primal-dual 3-approximation 
algorithm for Sum-Radii Clustering. Bilo et al. Q considered a generalization of Sum-A;-Radii, where 
the cost is the sum of the a-th power of the clusters radii, for a > 1, and presented a polynomial-time 
approximation scheme for Euclidean spaces of constant dimension. 

Charikar and Panigraphy also considered the incremental version of Sum-/c-Radii. Similarly 
to the online setting, an incremental algorithm receives the demands one-by-one and assigns them to a 
cluster upon arrival. However, an incremental algorithm can also merge any of its clusters at any time. 
They presented an 0(l)-competitive incremental algorithm for Sum-/c-Radii that uses O(k) clusters. 

In the online setting, where cluster reconfiguration is not allowed, the Unit Covering and the Unit 
Clustering problems have received most of the attention. In both problems, the demand points arrive 
one-by-one and must be irrevocably assigned to unit-radius balls upon arrival, so that the number 
of balls used is minimized. The difference is that in Unit Covering, the center of each ball is fixed 
when the ball is first used, while in Unit Clustering, there is no fixed center and a ball may shift and 
cover more demands. Charikar et al. |6J proved an upper bound of 0{2 d d\og d) and a lower bound of 
/2(log dj log log log d) on the deterministic competitive ratio of Unit Covering in d dimensions. The 
results of O imply a competitive ratio of 2 and 4 for Unit Covering on the line and the plane, respec- 
tively. The Unit Clustering problem was introduced by Chan and Zarrabi-Zadeh |0. The deterministic 
competitive ratio of Unit Clustering on the line is at most 5/3 ifTTTl and no less than 8/5 fPHl . Unit 
Clustering has also been studied in d-dimensions with respect to the norm, where the competitive 
ratio is at most |2 d , for any d, and no less than 13/6, for d > 2 IfTTTl . 

Departing from this line of work, Csirik at el. [fSj studied the problem of online clustering to 
minimize the sum of the setup costs and the diameters of the clusters (CSDF). Motivated by the 
difference between Unit Covering and Unit Clustering, they considered three models, the strict, the 
intermediate, and the flexible one, depending on whether the center and the radius of a new cluster are 
fixed at its opening time. Csirik at el. only studied CSDF on the line and proved that its deterministic 
competitive ratio is 1 + v2 for the strict and the intermediate model and (1 + v5)/2 for the flexible 
model. Recently, Diveki and Imreh J9] studied online clustering in two dimensions to minimize the 
sum of the setup costs and the area of the clusters. They proved that the competitive ratio of this 
problem lies in (2.22, 9] for the strict model and in (1.56, 7] for the flexible model. 

Contribution. Following JH, it is natural and interesting to study the online clustering problem of 
CSDF in metric spaces more general than the line. In fact, we consider the closely related problem of 

3 These problems are closely related in the sense that a c-approximation/competitive algorithm for Sum-fc-Radii implies a 
2c-approximation/competitive algorithm for Sum-fc-Diameters, and vice versa. 
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Online Sum-Radii Clustering (OnlSumRad), and give upper and lower bounds on its deterministic and 
randomized competitive ratio for general metric spaces and for the d-dimensional Euclidean space. 

We restrict our attention to the strict model of (H, where the center and the radius of each new 
cluster are fixed at its opening time. To justify our choice, we show that a c-competitive algorithm for 
the strict model implies an O(c) -competitive algorithm for the intermediate or the flexible model. 

We show that the deterministic competitive ratio of OnlSumRad for metric spaces other than the 
line is 0(logn), where the upper bound follows from a primal-dual algorithm and holds for general 
metric spaces, and the lower bound is valid for ternary Hierarchically Well-Separated Trees (HSTs) 
and for the Euclidean plane. This result is particularly interesting because it demonstrates that the 
deterministic competitive ratio of OnlSumRad (and of CSDF) changes abruptly, from constant to 
logarithmic, when we move from the line to the plane. We note that this does not happen when the 
cost of each cluster is proportional to its area (3. 

Another interesting finding is that OnlSumRad in metric spaces induced by HTSs is closely re- 
lated to the Parking Permit problem introduced by Meyerson lfl9l . In Parking Permit, we cover a set of 
driving days by choosing among K permit types, each with a given cost and duration. The permit costs 
are concave, in the sense that the cost per day decreases with the duration. The algorithm is informed 
of the driving days in an online fashion, and irrevocably decides on the permits to purchase, so that 
all driving days are covered by a permit and the total cost is minimized. Meyerson flT9l proved that 
the competitive ratio of Parking Permit is 0{K) for deterministic and ©(log K) for randomized algo- 
rithms. We prove that OnlSumRad in HSTs is a generalization of Parking Permit. Combined with the 
randomized lower bound of lil9l . this implies a lower bound of J7(log log n) on the randomized com- 
petitive ratio of OnlSumRad. Moreover, we show that, under some mild assumptions, a c-competitive 
algorithm for Parking Permit with K types implies a c-competitive algorithm for OnlSumRad in HSTs 
with K levels. 

On the positive side, we give a reduction from OnlSumRad in the d-dimensional Euclidean space 
to OnlSumRad in HSTs with K = O(logra) levels which increases the competitive ratio by a fac- 
tor of 0(2 d yd). Using the competitive-ratio-preserving reduction from OnlSumRad in HSTs with 
K levels to ParkPermit with K types and the randomized algorithm of |fT9ll . we obtain a random- 
ized 0(2 d \fd log log n) -competitive algorithm for OnlSumRad in the d-dimensional Euclidean space. 
Moreover, we present a simple and memoryless randomized algorithm that achieves a competitive ra- 
tio of 0(log n) for general metric spaces. 

We conclude with a deterministic 0(log log n) -competitive online algorithm for the fractional 
version of OnlSumRad in general metric spaces. In the fractional version, we maintain fractions of 
open clusters that cumulatively cover all demands. The fractional algorithm is based on the primal- 
dual approach of 1121 1 H . and generalizes the fractional algorithm of |[T9l for Parking Permit. We leave 
as an open problem the existence of a randomized rounding procedure that converts the fractional 
solution to an (integral) clustering of cost within a constant factor of the original cost. Such a rounding 
procedure would imply an (asymptotically optimal) randomized O (log log n) -competitive algorithm 
for OnlSumRad in general metric spaces. 

Other Related Work. At the conceptual level, OnlSumRad is related to the problem of Online Facility 
Location (see e.g. II18I14I13II ). However, the two problems exhibit a quite different behavior with 
respect to their competitive ratio, since the competitive ratio of Online Facility Location is &{ \ogi^ n )' 
even on the line, for both deterministic and randomized algorithms lfl4ll . 
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2 Notation, Problem Definition, and Preliminaries 

Notation. We consider a metric space (M, d), where M is the set of points and d : M x M \— > N 
is the distance function, which is non-negative, symmetric and satisfies the triangle inequality. For a 
set of points M' C M, we let diam(M') = max U)Ve M/{d(u, v)} be the diameter and rad(M') = 
min ue M' raax v£ ]\ji{d(u, v)} be the radius of M'. In a tree metric, the points correspond to the nodes 
of an edge-weighted tree and the distances are given by the tree's shortest path metric. For some 
a > 1, a Hierarchically a-Well-Separated Tree (a-HST) is a complete rooted tree with lengths on its 
edges such that: (i) the distance of each leaf to its parent is 1, and (ii) on every path from a leaf to 
the root, the edge length increases by a factor of a on every level. Thus, the distance of any node 
at level k to its children is a k " 1 and the distance of Vk to the nearest leaf is (a k — l)/(a — 1). We 
slightly abuse the notation and identify a tree (or an HST) with the metric space induced by it. 

A cluster C(p, r) = {v : d(p, v) < r} is determined by its center p and its radius r, and consists 
of all points within a distance at most r to p. The cost of a cluster C(p, r) is the sum of its opening 
cost / and its radius r. 

Sum-Radii Clustering. In the offline version of Sum-Radii Clustering, the input consists of a metric 
space (M, d), a cluster opening cost /, and a set D = {u\, . . . , u n } of demand points in M. The 
goal is to find a collection of clusters C(p±, r±), . . . ,C(pk,rk) that cover all demand points in D and 
minimize the total cost, which is Yli=i(f + r «)- 

Online Sum-Radii Clustering. In the online setting, the demand points arrive one-by-one, in an 
online fashion, and must be irrevocably assigned to an open cluster upon arrival. Formally, the input 
to Online Sum-Radii Clustering (OnlSumRad) consists of the cluster opening cost / and a sequence 
ux, . . . , u n of (not necessarily distinct) demand points in an underlying metric space (M, d). The goal 
is to maintain a set of clusters of minimum total cost that cover all demand points revealed so far. 

In this work, we focus on the so-called Fixed-Cluster version of OnlSumRad, where the center and 
the radius of each new cluster are irrevocably fixed when the cluster opens. Thus, the online algorithm 
maintains a collection of clusters, which is initially empty. Upon arrival of a new demand Uj, if Uj is 
not covered by an open cluster, the algorithm opens a new cluster C(p, r) that includes Uj, and assigns 
Uj to it. The algorithm incurs an irrevocable cost of / + r for the new cluster C(p, r). 

Competitive Ratio. We evaluate the performance of the algorithms presented in this work using com- 
petitive analysis (see e.g. H). A (randomized) algorithm is c-competitive if for any sequence of de- 
mand points, its (expected) cost is at most c times the cost of the optimal solution for the corresponding 
offline Sum-Radii instance. We highlight that the (expected) cost of the algorithm is compared against 
the cost of an optimal offline algorithm that is aware of the entire demand sequence in advance and 
has no computational restrictions whatsoever. 

Simplified Optimal. The following proposition, whose proof can be found in the Appendix, Sec- 
tion |AT] simplifies the structure of the optimal solution in the competitive analysis of our algorithms. 

Proposition 1. Let S be a feasible solution of an instance X of OnlSumRad. Then, there is a feasible 
solution S' of I with a cost of at most twice the cost of S, where each cluster has a radius of2 k f, for 
some integer k > 0. 

Other Versions of Online Sum-Radii Clustering. For completeness, we discuss two seemingly less 
restricted versions of OnlSumRad, corresponding to the intermediate and the flexible model in (H. 
In both versions, the demands are irrevocably assigned to a cluster upon arrival. In the Fixed-Radius 
version, only the radius of a new cluster is fixed when the cluster opens. The algorithms incurs an 
irrevocable cost of f+r for each new cluster C of radius r. Then, new demands can be assigned to C, 



4 



provided that rad(C) < r. In the Flexible-Cluster version, a cluster C is a set of demands with neither 
a fixed center nor a fixed radius. The algorithm's cost for each cluster C is / + rad(C), where rad(C) 
may increase as new demands are added to C. Clearly, the Fixed-Cluster version is a restriction of the 
Fixed-Radius version, which, in turn, is a restriction of the Flexible-Cluster version. The following 
proposition, whose proof can be found in the Appendix, Section IA.2I shows that from the viewpoint 
of competitive analysis, all the three versions are essentially equivalent. 

Proposition 2. A c-competitive algorithm for the Fixed-Radius (resp. Flexible-Cluster) version im- 
plies a 2c-competitive (resp. lOc-competitive) algorithm for the Fixed-Cluster version. 

Parking Permit. In Parking Permit (ParkPermit), we are given a schedule of days, some of which 
are marked as driving days, and K types of permits, where a permit of each type k, k = 1, . . . , K, 
has cost Cfc and duration d\~. The goal is to purchase a set of permits of minimum total cost that cover 
all driving days. In the online setting, the driving days are presented one-by-one, and the algorithm 
irrevocably decides on the permits to purchase based on the driving days revealed so far. 

Meyerson |[T9l Theorem 2.2] observed that by losing a constant factor in the competitive ratio, we 
can focus on the interval version of the problem, where each permit is available over specific time 
intervals (e.g. a weakly permit is valid from Monday to Sunday). Moreover, every day is covered by 
a single permit of each type k, and each permit of type k > 2 has permits of type k — 1 

embedded in it (see e.g. Fig.Q]for the structure of an interval instance). 

An interesting feature of Meyerson's algorithms is that they are time-sequence-independent, in 
the sense that they apply, with the same competitive ratio, even if the order in which the driving days 
are revealed is different from their time order (e.g. the adversary may mark Sept. 23 as a driving 
day, before marking Aug. 6 as a driving day). In fact, Meyerson's algorithms exploit the hierarchical 
decomposition of the time axis into intervals, imposed by the permit structure, and not the relative 
time order between different days / subintervals. 

3 Online Sum-Radii Clustering and Parking Permit 

In this section, we establish a correspondence between OnlSumRad and ParkPermit. Specifically, we 
show that OnlSumRad in tree metrics and the interval version of ParkPermit are essentially equiva- 
lent problems. A few of our results either are directly based on this correspondence or exploit this 
correspondence so that they draw ideas from ParkPermit. We start with the following theorem, which 
shows that OnlSumRad in tree metrics is a generalization of the interval version of ParkPermit. 

Theorem 1. A c-competitive algorithm for Online Sum-Radii Clustering in HSTs with K + 1 levels 
implies a c-competitive algorithm for the interval version of Parking Permit with K permit types. 

Proof. Given an instance X of the interval version of ParkPermit with K permit types, we construct 
an instance X' of OnlSumRad in an HST with K + 1 levels such that any feasible solution of X is 
mapped, in an online fashion, to a feasible solution of X' of equal cost, and vice versa. 

Let X be an instance of the interval version of ParkPermit with K permit types of costs c±,...,ck 
and durations di, . . . , dx- Without loss of generality, we assume that c\ = 1 and that all days are 
covered by the permit of type K. Given the costs and the durations of the permits, we construct a tree 
T with appropriate edge lengths, which gives the metric space for X' . The construction exploits the 
tree-like structure of the permits in the interval version of the problem (see also Fig. \Q. Specifically, 
the tree T has K + 1 levels, where the leaves correspond to the days of X's schedule, and each node 
at level k, 1 < k < K, corresponds to a permit of type k. 
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Fig. 1. An example of the reduction of TheoremQ] On the left, there is an instance of the interval version of ParkPermit. A 
feasible solution consists of the permits in grey. On the right, we depict the instance of OnlSumRad constructed in the proof 
of Theorem[T] The ParkPermit solution on left is mapped to a solution with clusters centered at each black node. The radius 
of each cluster is equal to the distance of its center to the nearest leaf. 

Formally, the tree T has a leaf, at level 0, for each day in the schedule of X. For each interval D\ 
of d\ days covered by a permit of type 1, there is a level- 1 node v i in T whose children are the d\ 
leaves corresponding to the days in D±. The distance of each level- 1 node to its children is c\ — 1 = 0. 
Hence, opening a cluster C{v\,c\ — 1) covers all leaves corresponding to the days in the interval D\. 
Similarly, for each interval Dk of dk days covered by a permit of type k, 2 < k < K, there is a node 
Vf, at level k in T whose children are the d^/d^-i nodes at level k — 1 corresponding to the permits of 
type k — 1 embedded within the particular permit of type k. The distance of each level-A; node to its 
children is Ck — Ck-i- Therefore, opening a cluster C(vk, Ck — 1) covers all all leaves corresponding 
to the days in the interval Dk- The cluster opening cost is / = 1. For each day t marked as a driving 
day in X, there is a demand located at the leaf of T corresponding to t in X'. 

Based on the correspondence between a type- A; permit and a cluster C(vk, Ck — 1) rooted at a 
level-/c node uj., we show in the Appendix, Section IA31 that any feasible solution of X is mapped, in 
an online fashion, to a feasible solution of X' of equal cost, and vice versa. □ 

In the proof of Theorem [Q if the ParkPermit instance has d\ = 1 and c& = 2 k , for all k = 
1 , . . . , K , the tree T is essentially a 2-HST with K levels where all nodes at the same level k have 
dk/dk-i children. Thus, combined with Theorem [T] the following lemma shows that OnlSumRad in 
such tree metrics is essentially equivalent to the interval version of ParkPermit. The proof of LemmaQ] 
applies essentially the reverse reduction of Theorem [T] and is deferred to the Appendix, Section [A4l 

Lemma 1. A c-competitive time-sequence-independent algorithm for the interval version of ParkPermit 
with K permits implies a c-competitive algorithm for OnlSumRad in HSTs with K levels, where all 
nodes at the same level have the same number of children and all demands are located at the leaves. 

4 Lower Bounds on the Competitive Ratio of Online Sum-Radii Clustering 

By Theorem [Q Online Sum-Radii Clustering in trees with K + 1 levels is a generalization of Park- 
ing Permit with K permit types. Therefore, the results of |[T9l imply a lower bound of Q(K) (resp. 
i?(log K)) on the deterministic (resp. randomized) competitive ratio of OnlSumRad in trees with K 
levels. However, a lower bound on the competitive ratio of OnlSumRad would rather be expressed in 
terms of the number of demands n, because not only there is no simple and natural way of defining the 
number of "levels" of a general metric space, but also for online clustering problems, the competitive 
ratio, if not constant, is typically stated as a function of n. 

Going through the details of the proofs of Theorem Q] and of [19, Theorems 3.2 and 4.6], we can 
translate the lower bounds on the competitive ratio of ParkPermit, expressed as a function of K, into 
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equivalent lower lower bounds for OnlSumRad, expressed as a function of n. In fact, the proofs of 
|fl9l Theorems 3.2 and 4.6] require that the ratio d^/dk-i of the number of days covered by permits 
of type k and A; — 1 is 2K. Thus, in the proof of Theorem [T] the tree T has (2K) K leaves, and the 
number of demands n is at most (2K) . Combining this with the lower bound of J?(logif) on the 
randomized competitive ratio of ParkPermit [19, Theorem 4.6], we obtain the following corollary: 

Corollary 1. The competitive ratio of any randomized online algorithm for Online Sum-Radii Clus- 
tering in tree metrics is I?(log log n), where n is the number of demands. 

A Stronger Lower Bound on the Deterministic Competitive Ratio. This approach gives a lower 
bound of ^ dogiogn ) on tne deterministic competitive ratio of OnlSumRad. Using a ternary HST 
instead, we next obtain a stronger lower bound. 

Theorem 2. The competitive ratio of any deterministic online algorithm for Online Sum-Radii Clus- 
tering in tree metrics is i?(log n), where n is the number of demands. 

Proof. For simplicity, let us assume that n is an integral power of 3. For some constant a € [2, 3), we 
consider an a-HST T of height K = log 3 n whose non-leaf nodes have 3 children each. The cluster 
opening cost is / = 1. Let A be any fixed deterministic algorithm. We consider a sequence of demands 
located at the leaves of T. More precisely, starting from the leftmost leaf and advancing towards the 
rightmost leaf, the next demand point in the sequence is located at the next leaf not covered by an 
open cluster of A. Since T has n leaves, A may cover all leaves of T before the arrival of n demand 
points. Then, the demand sequence is completed in an arbitrary way that does not increase the optimal 
cost. We let Copt be the optimal cost, and let Ca be the cost of A on this demand sequence. 

We let Cfe = l+X^=o ai denote the cost of a cluster centered at a level-/c node with radius equal 
to the distance of Vk to the nearest leaf. We observe that for any k > 1 and any a > 2, Ck < ack-i- 
We classify the clusters opened by A according to their cost. Specifically, we let L^, < k < K, be 
the set of A's clusters with cost in [cfc, Cfc+i), and let = \Lk\ be the number of such clusters. The 
key property is that a cluster in can cover the demands of a subtree rooted at level at most k, but 
not higher. Therefore, we can assume that all A's clusters in are centered at a level-/c node and 
have cost equal to c/., and obtain a lower bound of Ca > J2k=o on the algorithm's cost. 

To derive an upper bound on the optimal cost in terms of Ca, we distinguish between good and 
bad active subtrees, depending on the size of the largest radius cluster with which A covers the demand 
points in them. Formally, a subtree rooted at level k is active if there is a demand point located at 
some leaf of it. For an active subtree T^, we let C|? ax denote the largest radius cluster opened by A 
when a new demand point in Tfc arrives. Let j, < j < K, be such that C^ ax G Lj. Namely, C^ ax 
is centered at a level-j node Vj and covers the entire subtree rooted at Vj. If j > k, i.e. if C|? ax covers 
Tfc entirely, we say that is a good (active) subtree (for the algorithm A). If j < k, i.e. if C|? ax does 
not cover entirely, we say that is a bad (active) subtree (for A). 

For each k = 0, . . . , K, we let (resp. bk) denote the number of good (resp. bad) active subtrees 
rooted at level k. To bound from above, we observe that the last demand point of each good active 
subtree rooted at level k is covered by a new cluster of A rooted at a level j > k. Therefore, the number 
of good active subtrees rooted at level k is at most the number of clusters in UjL k Lj. Formally, for 

each level k > 0, < J2f=k^j- ^° Doun cl bk from above, we first observe that each active leaf / 
demand point is a good active level-0 subtree, and thus bo = 0. For each level k > 1, we observe 
that if Tfc is a bad subtree, then by the definition of the demand sequence, the 3 subtrees rooted at the 
children of Tj.'s root are all active. Moreover, each of these subtrees is either a bad subtree rooted at 
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level k — 1, in which case it is counted in or a good subtree covered by a cluster in in 
which case it is counted in ik-i- Therefore, for each level k > 1, 36fc < bk-i + ik—\- 

Using these bounds on and b^, we can bound from above the optimal cost in terms of Ca- To 
this end, the crucial observation is that we can obtain a feasible solution by opening a cluster of cost 
centered at the root of every active subtree rooted at level k. Since the number of active subtrees rooted 
at level k is bk+g k , we obtain that for every k > 0,Copt < Ck(bk+9k)- Using the upper bound on 
and summing up for k = 0, . . . , K, we have that (K + 1)Copt < Ylk=o Ckbk + Y.k=o Ck Ylf=k ?j ■ 

Using that < a k and that ct < ack-i, which hold for all a > 2, we bound the second term by: 

K K K k K k K K 

^2 c k ^2 tj = ^2 ^ c j - ^ ai - ^ k ° k + i - a ^2 ^ kCk - oCa 

k=0 j=k k=0 j=0 k=0 j=0 k=0 k=0 

To bound the first term, we use that for every level k > 1, 36^ < bf c _i + lk-i and c& < ac^-i- 
Therefore, (3/a)&fcC& < (bk-i + £k-i)ck-i- Summing up for k = 1,...,K, we have that: 

^ K K K 

- ^ b k c k < ^2 bk-iCk-i + ^2 4-iCfc-i 

k=l k=l k=l 

Using that &o = and that a < 3, we obtain that: 

^ K K-l K-l K K 

- ^2 bk ° k - "12 hk ° k + ^2 tkCk - ^2 bk ° k +Ca ^ ^2 bkCk - 3^7^ Ca 

k=0 k=0 k=0 k=0 k=0 

Putting everything together, we conclude that for any a £ [2, 3), (K + 1)Copt < (a + ^^)Ca- 
Since K = log 3 n, this implies the theorem. □ 

A Lower Bound for Deterministic OnlSumRad on the Plane. Motivated by the fact that the de- 
terministic competitive ratio of OnlSumRad on the line is constant [8], we study OnlSumRad in Eu- 
clidean spaces of small constant dimension. The following theorem uses a constant-distortion planar 
embedding of the ternary a-HST employed in the proof of Theorem |2j and establishes a lower bound 
of I?(log n) on the deterministic competitive ratio of OnlSumRad on the Euclidean plane. 

Theorem 3. The competitive ratio of any deterministic online algorithm for Online Sum-Radii Clus- 
tering on the Euclidean plane is fi(log n), where n is the number of demands. 

Proof sketch. Using a planar embedding of a ternary a-HST T with distortion D a < y/2a/(a — 2), 
we show that a c-competitive algorithm for OnlSumRad on the plane implies a 2cD a -competitive 
algorithm for HSTs. The details can be found in the Appendix, Section !a31 □ 

5 An Optimal Primal-Dual Algorithm 

In this section, we present a deterministic primal-dual algorithm for OnlSumRad in a general metric 
space (M, d). Throughout this section, we assume that the optimal solution only consists of clusters 
with radius 2 k f, where A; is a non-negative integer (see also Proposition [TJ). For simplicity, we let 
= 2 k f, if k > 0, and r& = 0, if k = — 1. Let N = N U { — 1}. Then, the following are a Linear 
Programming relaxation of OnlSumRad and its dual: 
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n 




max > a,- 



aj < f + r k V 0, fc) 



(z,k):d(uj,z)<rk 



j:d(uj,z)<r k 

aj > 



x 2fc > \/(z,k) 



In the primal program, there is a variable x zk for each point z of the metric space and each k G TV 
that indicates the extent to which cluster C (z, r^) is open. The constraints require that each demand Uj 
is fractionally covered. If we require that x z k E {0, 1} for all z, k, we obtain an Integer Programming 
formulation of OnlSumRad. In the dual program, there is a variable aj for each demand Uj, and the 
constraints require that no potential cluster is "overpaid". 

The primal-dual algorithm for OnlSumRad, or PD-SumRad in short, maintains a collection of 
clusters that cover all the demands processed so far. When a new demand Uj, j = 1, . . . , n, arrives, if 
uj is covered by an already open cluster C, PD-SumRad assigns uj to C and sets Uj's dual variable aj 
to 0. Otherwise, PD-SumRad sets aj to /. This makes the dual constraint corresponding to (uj, —I) 
and possibly some other dual constraints tight. PD-SumRad finds the maximum k G N such that 
for some point z G M, the dual constraint corresponding to (z, k) becomes tight due to aj. Then, 
PD-SumRad opens a new cluster C(z, 3rfc) and assigns Uj to it. The main result of this section is that: 

Theorem 4. The competitive ratio of PD-SumRad is <9(log n). 

The lower bound on the competitive ratio of PD-SumRad follows from Theorem [2] The proof of 
the upper bound consists of a pair of lemmas. In the Appendix, Section lAT6l we show that the dual 
solution maintained by PD-SumRad is feasible. Thus the optimal cost for any demand sequence is at 
least the value of the dual solution maintained by PD-SumRad. The following lemma shows that the 
total cost of PD-SumRad is at most O(logra) times the value of its dual solution. 

Lemma 2. The cost of PD-SumRad is at most 3(2 + log 2 n) Y^j=i a j ■ 

Proof. We call a cluster C(z, rjS) tight if the dual constraint corresponding to (z, k) is satisfied with 
equality. We observe that for any integer k > log 2 n and for all points z, C(z, k) cannot become tight, 
because the lefthand-side of any dual constraint is at most nf. Therefore, we can restrict our attention 
to at most 2 + log 2 n values of k. 

Next, we show that for all k = — 1, 0, ... , [log 2 n \ . eac h demand Uj with aj > contributes to the 
opening cost of at most one cluster with radius 3r^. Namely, PD-SumRad opens at most one cluster 
C(z, 3rfc) for which Uj belongs to the tight cluster C(z, r&). We prove this claim by contradiction. 
Let us assume that for some value of k, PD-SumRad opens two clusters C\ = C(z±, 3r&) and C 2 = 
C(z 2 , 3rfc) for which there is a demand Uj with aj > that belongs to both C{z\, r^) and C(z 2 , r^). 
Since PD-SumRad opens at most one new cluster when a new demand is processed, one of the clusters 
C\, C 2 opens before the other. So, let us assume that C\ opens before C 2 . This means that PD-SumRad 
opened C\ in response to a demand Uji, with j' < j, that was uncovered at its arrival time and made 
C(z\, rfc) tight. Then, any subsequent demand u G C(z 2 , r^) is covered by C\, because: 



The second inequality above holds because both u and Uj belong to C(z 2 ,rfc) and Uj also belongs 
to C{z\, rfc). Therefore, after C\ opens, there are no uncovered demands in C(z 2 , r^) that can force 
PD-SumRad to open C 2 , a contradiction. 



d(u, z\) < d(u, Uj) + d(uj, z\) < 2r k + = 3r k 
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To conclude the proof of the lemma, we observe that when PD-SumRad opens a new cluster 
C(z, 3rfe), the cluster C(z, r^) is tight. Hence, the total cost of C(z, Sr^) is at most 3 J2 U ec(z r k ) a r 
Therefore, the total cost of PD-SumRad is at most: 

n 

3aj =3^2 a j \{( z ' *0 : C( z ' 3r fe) opens and Uj G C(z, r fc )}| 

(z,fc):C(^,3rfc) opens uj&C(z,rf c ) j=l 

ii 

< 3(2 + log 2 n) a, 

The inequality above holds because for each k = —1,0, ... , |log 2 n\ and each demand Uj with 
aj > 0, there is at most one pair (z, k) such that C(z, 3r&) opens and Uj G C(z, r^). □ 



6 Randomized and Fractional Algorithms 

Throughout this section, we assume, essentially without loss of generality, that the optimal solution 
only consists of clusters of radius 2 k f, where is a non-negative integer. 

A Simple Randomized Algorithm for General Metrics. We start with a simple randomized algo- 
rithm, or Simple-SumRad in short, with a logarithmic competitive ratio. Simple-SumRad is a mem- 
oryless algorithm, in the sense that it keeps in memory only the solution maintained by it, namely 
the centers and the radii of its clusters. For simplicity, we assume that n is an integral power of 2 
and known to the algorithm in advance. This assumption can be easily removed by standard tech- 
niques (see also Section pV9l in the Appendix). When a new demand Uj arrives, if uj is covered by 
an already open cluster C, Simple-SumRad assigns Uj to C. Otherwise, for each k = 0, . . . , log 2 n, 
Simple-SumRad opens a new cluster C(uj, 2 k f) with probability 2~ k , and assigns uj to the cluster 
C(uj, f), which opens with probability 1. In the Appendix, Section IA771 we show that: 

Lemma 3. Simple-SumRad achieves a competitive ratio of at most 2(4 + log 2 n). 

A Randomized Algorithm for OnlSumRad in d Dimensions. Next, we present a reduction of 
OnlSumRad in the <i-dimensional Euclidean space into OnlSumRad in HSTs where each internal 
node has 2 d children. The reduction increases the competitive ratio by a factor of 0(2 d y/d). 

Lemma 4. A c(K)-competitive algorithm for OnlSumRad in HSTs with K levels, where all nodes at 
the same level have the same number of children and all demands are located at the leaves, implies 
a 0(2° ''^/d c{K))- competitive algorithm for OnlSumRad in the d-dimensional Euclidean space, with 
K = O(logra). 

Proof sketch. Given an instance X of OnlSumRad in the d-dimensional Euclidean space, we construct 
an instance X' of OnlSumRad in a 2-HST T with K = 0(log n) levels. The HST T corresponds to a 
hierarchical decomposition V of the d-dimensional space into hypercubes with K + 1 levels, where 
the hypercubes at level k have a side length of 2 k . 

Using that any cluster of radius 2 fc ~ 1 in X intersects at most 2 d level-A; hypercubes of the hierarchi- 
cal decomposition V, we show that any feasible solution of X of cost c is mapped to a feasible solution 
of X' of cost at most 0(2 d c). On the other hand, any cluster Ck of radius 2 k — 1 in T corresponds to a 
single level-A; hypercube of V. Thus, the demands covered by in T can be covered by a single 
cluster of radius V~d ■ 2 k ~ 1 in X. Therefore, given a feasible solution S of X' of cost c, opening the 
clusters in the d-dimensional Euclidean space that correspond to the clusters of S, we obtain a feasible 
solution of X of cost at most \fdc. A detailed proof can be found in the Appendix, Section |AT8l □ 
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The following theorem follows by combining Lemma|4]with LemmaQ] and applying the random- 
ized 0(log K) -competitive algorithm of |fl9l Section 4.1] for the interval version of ParkPermit. We 
note that the proof of Lemma |4] assumes that the number of demands n is known in advance. This 
assumption can be easily removed by standard techniques (see also Section [A9l in the Appendix). 

Theorem 5. There is a randomized 0(2 d \/ r dloglogn)-competitive algorithm for Online Sum-Radii 
Clustering in the d-dimensional Euclidean space. 

A Fractional Algorithm for General Metrics. We conclude with a deterministic O(loglogra)- 
competitive algorithm for the fractional version of OnlSumRad in general metric spaces. The frac- 
tional algorithm is based on the primal-dual approach of 1121 1 H . and is a generalization of the online 
algorithm for the fractional version of ParkPermit presented in |[T9l Section 4.1]. 

A fractional algorithm maintains, in an online fashion, a feasible solution to the Linear Program- 
ming relaxation of OnlSumRad. In the notation of Section [5J for each point-type pair (z, k), the al- 
gorithm maintains a fraction x zk , which denotes the extent to which the cluster C(z, r k ) opens, and 
can only increase as new demands arrive. For each demand uj, the fractions of the clusters cover- 
ing Uj must sum up to at least 1, i.e. J2( z k)-Uj&c(z r k ) x zk > 1- The total cost of the algorithm is 
S(z k) x zk{f + r k)- The competitive ratio is the worst-case ratio of the algorithm's cost to the cost of 
an optimal integral solution on the same demand sequence. 

The Fractional Algorithm. For the fractional algorithm, or Frac-SumRad in short, we assume that the 
number of demands n is an integral power of 2 and known in advance. In the Appendix, Section [A!9l 
we remove these assumptions, by losing a constant factor in the competitive ratio. 

Frac-SumRad considers only K + 1 different types of clusters, where K = log 2 n. For each 
k = 1, . . . , K + 1, we let Ck = f + rk denote the cost of a cluster C(p, r k ) of type k. The algorithm 
considers only the demand locations as potential cluster centers. For convenience, for each demand uj 
and for each k, we let Xj k be the extent to which the cluster C(uj, r k ) is open, with the understanding 
that Xjk = before Uj arrives. Similarly, we let Fjk = fc^eCK r k ) x ik be the extent to which 
demand Uj is covered by clusters of type k, and let Fj = Fjk be the extent to which Uj is covered. 

When a new demand Uj, j = 1, . . . , n, arrives, if Fj > 1, Uj is already covered. Otherwise, while 
Fj < 1, Frac-SumRad performs the following operation: 

1. For every k = 1, . . . K + 1, Xj k <- x jk + Cfc( ^ +1) 

2. For every k = 1, . . . , K + 1 and every demand Ui G C(uj, r k ), Xik <— Xik(l + ^) 

Competitive Analysis. Frac-SumRad maintains a (fractional) feasible solution in an online fashion. 
The proof of the following theorem extents the competitive analysis in |fl9l Section 4.1]. 

Theorem 6. The competitive ratio of Frac-SumRad is 0(log log n). 

Proof. We first consider a single operation performed when a demand Uj arrives, and show that it 
increases the fractional cost by at most 2. Since an operation is performed, Fj < 1. The first step of 
the operation increases the fractional cost by 1/(K + 1) for each cluster type. Hence, the total increase 
in the fractional cost is 1. The second step of the operation increases the fractional cost by: 

K+l 

^ ~] X{k — ^ ^ %ik — ^ ^ Fjk — Fj *~ 1 

(i,k):Ui£C{uj,r k ) (i,k):iij£C(ui,r k ) k=l 

We next show that the number of operations performed by Frac-SumRad for the demands in an 
optimal cluster C(p, r k ) of cost c k is 0(c k+ i log K). We let F p{k+1) = £j:ujeC(p,r fc ) ^(fc+i)- Since 
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for any demand uj £ C(p, r^), C(uj,rk+i) includes the entire C(p, r^), we have that Fj^+i) > 
-Pp(fc+i)- Hence, as soon as F p ( fc+1 ) > 1, every subsequent demand Uj £ C(p,rk) has Fj > 1 at its 
arrival time, and Frac-SumRad does not perform any operations due to demands uj. Consequently, 
the total cost of Frac-SumRad for the demands in C(p, r^j can be bounded by the total increase in the 
fractional cost due to operations caused by demands in C(p, r k ) arriving as long as F p / k+ ^ < 1. 

To bound the number of such operations, we observe that after the first Cf.+i operations caused by 
demands in C(p, r k ), F p ( k+1 \ becomes at least 1/{K + 1), due to the first step of these operations. 
For each subsequent operation caused by a demand in C{p,rk), all fractions with Uj £ 

C(p,r k ), increase by factor of (1 + ). Therefore, F v ( k+ D increases by a factor of (1 H — ). 

After 0(cfe + i log K) such increases, F p (k+i) becomes at least 1, and Frac-SumRad does not perform 
any additional operations due to demands in C(p, r k ) arriving afterwards. 

Therefore, the total fractional cost of Frac-SumRad for the demands in an optimal cluster C(p, r k ) 
of cost c k is 0(cfc + i log K). Then, the theorem follows from c k+ i < 2c k and K = log 2 n. □ 
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A Appendix 

A.l The Proof of Proposition Q] 

For each cluster C(v, r) of S, S' opens a cluster C(v, 2 k f), where k = max{ [log 2 (r//)] , 0}. Clearly, 
C(v, 2 k f) covers all the demand points covered by C(v, r), and thus S' is a feasible solution. As for 
the cost of S', we next show that the cost of C(v,2 k f), that is (1 + 2 k )f, is most twice the cost 
of C(v,r), that is / + r. If r < /, in which case k = 0, the cost of C(v,2 k f) is 2f. If r > /, 
2 k f < 2 1+1 °g2 ('"//)/ = 2r. Therefore, the cost of C(v, 2 k f) is at most / + 2r < 2(f + r). □ 

A.2 The Proof of Proposition |2] 

We first assume a c-competitive algorithm A for the Fixed-Radius version. Based on A, we describe 
an algorithm A' for the Fixed-Cluster version that simulates the behavior of A and has a competi- 
tive ratio of at most 2c. Whenever the algorithm A opens a new cluster C of radius r and covers a new 
demand u, the algorithm A' opens a new cluster C" = C(u, 2r). The cost of C is at most twice the 
cost of C. Moreover, since any subsequent demand u' assigned to C by A is at distance at most 2r to 
u, the new cluster C also covers u' . Therefore, A' covers all demands with a total cost at most twice 
the total cost of A. 

Next, we assume a c-competitive algorithm A for the Flexible-Cluster version. Based on A, 
we describe an algorithm A' for the Fixed-Cluster version that simulates the behavior of A and 
has a competitive ratio of at most 10c. 

Let u be a new demand assigned to a cluster C by the algorithm A. If C is a new cluster that 
includes only u, A' opens a new cluster C(u, f) and assigns u to it. Otherwise, let u be the demand 
in C arrived first. If u is covered by an open cluster of A' centered at u, u is assigned to it. Otherwise, 
A' opens a new cluster C(u, 2 k f), where k = [~log 2 (tf(u, u)/ /)] , and assigns u to it. 

We compare the total cost of the algorithms A and A' for covering the demands in cluster C 
just after the assignment of u. The cost of A is at least / + diam(C)/2. If diam(C) < /, all de- 
mands in C are within a distance of / to C's first demand u, and are assigned to the cluster C(u, f) 
opened by A' when u arrived. Thus, the total cost of A' for the demands in C is at most 2f. If 
diam(C) > /, A' covers the demands in C by opening, in the worst case, a sequence of I + 1 clusters 
C(u, f),C(u, 2/), . . . , C(u, 2 e f), where t = [log 2 (diam(C) //)] . Thus, the total cost of A 1 for the 
demands in C is at most 

e 

^(1 + 2 l )f = (£ + 2 1+e )f < 5 diam(C7) , 

i=0 

where the inequality follows from |~log 2 x] < x and |~log 2 x] < 1 + log 2 x, for all x > 1. □ 
A.3 The Solution Mapping in the Proof of Theorem Q] 

We first describe an online mapping of any feasible solution of I to a feasible solution of I' of equal 
cost. By the construction of T, a permit of type k that covers the driving days in an interval in X 
corresponds to a node Vk at level k of T, in the sense that opening a cluster C(vk, C& — 1) covers all 
demands corresponding to the driving days in D^. Moreover, the cost of C(vD k , c& — 1) is c^, i.e., 
equal to the cost of the corresponding permit. Therefore, opening the clusters corresponding to the 
permits bought by a feasible solution of X gives a feasible solution of X' of equal cost. 
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For the converse mapping, we assume that in any feasible solution of X', all clusters are centered at 
nodes at levels 1, . . . , K of T and that any cluster centered at a level-A; node Vk has radius Ck — 1. This 
assumption is essentially without loss of generality, since any feasible solution without this property 
can be translated into a feasible solution of no greater cost that satisfies this property. Indeed, let 
Cfc = C(vk, r) be any cluster rooted at Vk- If Vk is a leaf, we can root Ck at the ancestor of Vk (recall 
that a leaf and its ancestor are at distance to each other). If r < Ck — 1, does not cover any leaves, 
and can be safely removed from the solution. Finally, if for some level j > k, r G [cj — 1, Cj + \ — 1), 
we can replace Ck by a new cluster which is rooted at the level- j ancestor of Vk and has a radius of 
Cj — 1. The new cluster covers all demand covered by Ck at no greater cost. 

In such a solution, each cluster C(vk, c& — 1) costs Ck and, by the construction of T, corresponds 
to a parking permit of type k that covers all the driving days corresponding to the demand points in 
the subtree rooted at Therefore, buying the parking permits corresponding to the clusters opened 
by a feasible solution of X' gives a feasible solution of X of equal cost. □ 



A.4 The Proof of LemmaQ] 

Given an instance X of OnlSumRad in an HST with K levels, we construct an instance X' of the 
interval version of ParkPermit with K permits, such that any feasible solution of X is mapped, in an 
online fashion, to a feasible solution of X' of equal cost, and vice versa. 

Let X be an instance of OnlSumRad in an a-HST T with K levels, where all nodes at level k, 
1 < k < K — 1, have the same number of children, and all demands are located at the leaves of 
T. Without loss of generality, we assume that the cluster opening cost is / = 1. The permits of X' 
essentially reflects the structure of T. Specifically, there is a day in the schedule of X' corresponding 
to each leaf of T. For each leaf vq, there is a permit of type with cost Co = 1 and duration do = 1. 
This permit covers the day corresponding to vq and is equivalent to a cluster C(vq,0) of cost 1. 
Similarly, for each node Vk at level k of T, 1 < k < K — 1, there is a permit of type k with cost 
Cfc = (a k + a — 2)/(a — 1) and duration dk = Ylj=i n j- This permit covers the days corresponding 
to the leaves of the subtree rooted at Vk and is equivalent to a cluster C(vk, {a k — 1 ) / (a — 1 ) ) of cost 
equal to q.. The permits of type k — 1 corresponding to the children of Vk in T are embedded in the 
permit of type k corresponding to Vk, in the sense that the intervals covered by the former permits form 
a partition of the interval covered by the latter. As for the demand sequence of X', for each demand of 
X located at a leaf vq of T, the day corresponding to vq in X' is marked as a driving dajQ. 

Next, we describe an online mapping of any feasible solution of X to a feasible solution of X' of 
equal cost. Similarly to the proof of Theorem [TJ we assume, without loss of generality, that in any 
feasible solution of X, any cluster centered at a level-/c node Vk has a radius of (a fc — l)/(a — 1). 
Then, each cluster C{vk, (a k — l)/(a — 1)) costs c^, and corresponds to a permit of type k that covers 
all driving days corresponding to leaves of the subtree rooted at Vk- Therefore, purchasing the permits 
corresponding to the clusters of a feasible solution of X' gives a feasible solution of X of equal cost. 

For the converse mapping, we observe that a permit of type k that covers the driving days in an 
interval Dk corresponds to a level-A; node Vk of T, in the sense that opening a cluster C(vk, Ck — 1), 
of cost Ck, covers all demand points corresponding to the driving days in Dk- Therefore, opening the 
clusters corresponding to the permits purchased by a feasible solution of X gives a feasible solution of 
X' of equal cost. □ 

4 We highlight that the leaves of T can appear in the demand sequence of X in any order. Thus, we require that the 
ParkPermit algorithm is time-sequence-independent, i.e., it can handle driving requests that arrive out of the time order. 
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A.5 The Proof of Theorem g] 

We first show that a constant-distortion planar embedding of a ternary a-HST T implies the theorem. 
Specifically, let a be any constant in (2, 3), and let D a be the distortion of an embedding e that maps 
each node v of T to a point e(v) in the plane. Namely, for every pair of nodes u, v of T, we have that 
dp(u, v)/D a < dp(e(u), e(v)) < dp(u, v), where dp(u, v) (resp. dp(u, v)) denotes the distance of u 
and v in T (resp. in the Euclidean plane). Assuming the embedding e and a c-competitive deterministic 
algorithm A for OnlSumRad on the plane, we describe a 2cD Q -competitive algorithm A' for T. 

For any demand point u in T, we present the algorithm A with a demand located at e(u). If A 
covers e(u) by opening a new cluster C(v,r), the algorithm A' opens a new cluster C(u,2D a r). 
Then, for every node z of T for which e(z) is covered by C(v, r), z is covered by the corresponding 
cluster C(u, 2D a r) of A'. This holds because dp(e(u), e(v)) < 2r and the distortion of e is D a . If 
e(u) is covered by an existing cluster of A, the previous observation implies that u is covered by the 
corresponding cluster of A'. 

Since for any demand points u, u', the distance of e(u) and e(u') in the plane is no greater than 
their distance in T, the optimal cost of the instance presented to A is no greater than the optimal cost 
of the instance presented to A'. Also, the cost of each cluster of A' is at most 2D a times the cost of 
the corresponding cluster of A. Therefore, the competitive ratio of A' is at most 2D a c. Since D a is 
a constant and, by Theorem |2j the competitive ratio of A' is J? (log re), the competitive ratio of A is 
J? (log re) as well. 

To conclude the proof, we describe a D a -distortion embedding of a ternary a-HST T with K + 1 
levels in the Euclidean plane. The root of T is mapped to the point (0, 0). The children of the root 
are mapped to the points (— a K , 0), (0, a K ), {a K , 0). For each level-fc node v^, k = K, . . . , 1, whose 
parent is located along the x-axis on the left (resp. on the right), its children are mapped to the 3 points 
at distance a k ~ l to located along the x-axis on the right (resp. on the left) and along the y-axis up 
and down. For each level-A; node Vk, k = K, . . . , 1, whose parent is located down along the y-axis, 
its children are mapped to the 3 points at distance a k ~ l to located up along the y-axis and left and 
right along the x-axis (see also Fig. 12). 

We proceed to show that the distortion of this embedding is at most y/2a/(a — 2). We first observe 
that for any two nodes u, v of T, dp(e(v),e(u)) < dp{u, v), i.e., the distance of u and v in T is no 
less than the distance of their images e(u) and e(v) in the plane. Moreover, due to the self-similarity 
of the embedding, the maximum distortion occurs for pairs of leaves of T mapped to points in the 
plane that lie at symmetric locations with respect to the line y = x (or to the line y = — x) and are 
closest to it (e.g., such are the pairs of leaves/points 21 and 24, 22 and 23, 31 and 32, and 30 and 33 in 
Fig. O. The distance of any such a pair of leaves u, v in T is dr(u, v) = 2(a K+1 — l)/(a — 1). On 
the other hand, the distance of their images e(u), e(v) in the Euclidean plane is: 




Therefore, the maximum distortion of the embedding is: 



D a = 



2{a K+1 - 1) V2a 



V2(a K + 1 - 2a K + 1) ~ a - 2 



where the inequality holds for all a G (2, 3). 



□ 
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Fig. 2. An example of the embedding used in the proof of Theorem [5] The nodes and the edges connecting them depict the 
structure of a ternary q-HST with 4 levels and a ~ 2.5. The locations of the nodes correspond to the locations in the plane 
to which they are mapped by the embedding. 

A.6 PD-SumRad: Feasibility of the Dual Solution 

We show that the dual solution maintained by PD-SumRad satisfies all the dual constraints. In the 
dual solution maintained by PD-SumRad, each variable ctj is either or /. Since the righthand-side of 
any constraint is a multiple of /, no constraint can be violated without first becoming tight. To prove 
the lemma, we show that after a constraint becomes tight, its lefthand-side does not increase, and thus 
the constraint will never be violated. 

We call a cluster C(z,rfc) tight if the dual constraint corresponding to (z,k) is satisfied with 
equality. We next prove that as soon as a cluster C(z,rk) becomes tight, each subsequent demand 
u G C(z,rk) is covered by some open cluster of PD-SumRad, and thus the corresponding dual 
variable is set to 0. To this end, let us consider some cluster C(z,rfc) that becomes tight when a 
demand Uj is processed. Then, d(uj,z) < r&. We assume that to cover Uj, PD-SumRad opens a 
new cluster C' = C(z',3rfc/). The algorithm ensures that k' > k (and thus > rk) and that 
d(uj, z') < ry. Now let u be any subsequent demand in C(z, r^). Since 



u is covered by C . The first inequality above holds because the metric space satisfies the triangle 
inequality; the second holds because both u and uj belong to C{z,rjS). Finally, the third inequality 



d(u, z') < d(u, Uj) + d(uj, z') < 2rfc + < 3r^/ 



follows from > r^. 



□ 
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A.7 The Proof of LemmaH 



We recall the assumption that the optimal solution only consists of clusters with radius of the form 
2 k f, where A; is a non-negative integer. To establish the competitive ratio, we consider an optimal 
cluster C(p, 2 k f) of total cost (2 k + 1)/, k < log 2 n, and bound the expected cost of the algorithm 
until it opens a cluster that includes the entire cluster C(p,2 k f). 

Let tti, tt2, • • • , ut be the sequence of demands, where the cluster that ut opens covers all of 
C(p, 2 k f). Notice that T itself is a random variable. For each demand u.j, we let Xi be the random 
variable for the cost of the clusters that m opens. Hence, the total algorithm's cost for u\, u%, ■ ■ ■ , ut 
is X = YlJ=i Xi- For each demand m, Xi is if Ui is covered upon aiTival. Otherwise, Xi follows the 
distribution in the description of Simple-SumRad. Let Y L be a new random variable such that Y$ = Xi 
if ui is not covered, else Yi takes a value as if Ui was not covered at its arrival time. Clearly, for each 
i, Xi < Thus, the expected cost of Simple-SumRad until a cluster covering all of C(p, 2 k f ) opens 
is: 



E 



< E 



' T 

£ 

,i=i 



Yi 



We observe that Yi are nonnegative, independent and identically distributed random variables, and 
that T is a stopping time. Hence, by Wald's equation we have that E[J^^ =1 Yi] = K[Y] • E[T], where 
Y denotes the (identical) distribution of Yi, . . . , Yp. 

E[T] denotes the expected number of demands in C(p, 2 k f) that have arrived before the first of 
them opens a new cluster of radius 2 k+1 f that includes the entire cluster C(p, 2 k f). Hence, E[T] = 
2 k+1 . Moreover, we have that: 

log 2 n 



E M= E ^(2 i + l)/<(3 + log 2 n)/ 



Taking also into account the cost of (2 fc+1 + 1)/ for the cluster of radius 2 k+1 f opened by ut, the 
expected cost of the algorithm for the demands in C(p, 2 k f) is at most (2 fc+1 (3+log 2 n) + 2 fc+1 + l)/, 
which is at most 2(4 + log 2 n) times the optimal cost for C(p, 2 k f). □ 



A.8 The Proof of LemmaH 

We consider an instance X of OnlSumRad in the <i-dimensional Euclidean space. We assume, without 
loss of generality, that the cluster opening cost is / = 1 and that the demands lie in a bounded d- 
dimensional hypercube of side length A. Moreover, we assume that the number of demands n is an 
integral power of 2 and given in advance. Given I, we construct an instance 1' of OnlSumRad in a 2- 
HST T with 0(log n) levels, where all nodes at the same level have the same number of children, and 
all n demands are located at the leaves of T. The construction ensures that (a) any feasible solution 
of X of cost c is mapped to a feasible solution of X' of cost at most 0(2 d c); and (b) any feasible 
solution of X' of cost d is mapped, in an online fashion, to a feasible solution of X of cost at most 
Vdd. Consequently, given a c(K) -competitive algorithm for X', we can apply this transformation and 
obtain a 0(2 c( v / dc(i 1 C))-competitive algorithm for X, where K = 0(log n). 

The 2-HST T that determines the metric space of X' corresponds to a hierarchical decomposition 
V of the ci-dimensional space into hypercubes with K + 1 levels, where K = log 2 n. More precisely, 
the highest level K of the decomposition partitions the space into [Z\/n] d level- K d-dimensional 
hypercubes, each with a side length of 2 K = n. Each level-fc hypercube, k = K, . . . , 1, has a side 
length of 2 k , and is partitioned into 2 d level-(/c — 1) children hypercubes of side length 2 fc_1 . 
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The root of the 2-HST T (at level K + 1) has as many children as the number of level- A" hy- 
percubes. The distance between the root and the children is set to some arbitrarily large number (to 
comply with the definition of HSTs, one may replace the edges from the root to its children by appro- 
priately long paths, where the edge length drops by a factor of 2 on each level). There is an HST node 
Vk at level k, k = K , . . . , 1, corresponding to each level-A; hypercube in V. Each level-/c node Vk 
has 2 d children in T that correspond to the 2 d children of in V. The distance in T between each 
level-fc node to its children is 2 fe_1 . The level-0 nodes of T are leaves and do not have any children. 
The cluster opening cost is / = 1. For each demand uj in X, there is a demand Uj located at the leaf 
of T corresponding to the level-0 hypercube that includes uj in the hierarchical decomposition V. 

To establish property (a) above, we consider a feasible solution of X, and let C(v,2 k ~ 1 ), with 
1 < k < K + 1, be any of its clusters. We observe that C(v,2 k ~ 1 ) intersects at most 2 d level-fc 
hypercubes h\,...,hJt of the hierarchical decomposition V. Hence, all demands in T that correspond 
to the demands in X covered C (v, 2 k ~ 1 ) can be covered by 2 d clusters with cost 2 k rooted at the level- 
k nodes of T that correspond to h\, . . . , h\ . The total cost of these clusters in T is 2 d 2 k , i.e., at most 
2 d+l times the total cost of C(v, 2 k ~ 1 ). Therefore, any feasible solution of X of cost c can be mapped 
to a feasible solution of X' of cost at most 0(2 d c). 

As for property (b), we consider any feasible solution of X' with total cost c. As in the proof of 
Theorem CD we assume, without loss of generality, that all clusters are centered at nodes at levels 
1, . . . , K, and that any cluster centered at a level-fc node Vk has a radius of 2 k — 1. Let C(vk, 2 k — 1) 
be any cluster centered at a level-A; node of T. Then, the demands in X that correspond to the 
demands covered by C(vk, 2 k — 1) in X' belong to a single level-A; hypercube of the hierarchical 
decomposition V. Hence, all these demands can be covered by a single cluster with radius \fd ■ 2 k ~ 1 
centered at the center of h^. Therefore, given a feasible solution S of X' of cost c, opening the clusters 
in the d-dimensional Euclidean space that correspond to the clusters of S, we obtain a feasible solution 
of X of cost at most \fd c. □ 

A.9 Frac-SumRad: Estimating the Number of Demands 

To remove the assumption that n is known to Frac-SumRad in advance, we run Frac-SumRad in 

phases, where each phase £ uses an estimation ri£ = 2 2 of n. Phase I, where I = 1,2,..., ends just 
after its processing of ri£ demands. Then, the algorithm keeps the fractional solution for the demands 
arriving in phase i, and starts computing a new fractional solution for the next demands arriving in 
phase £+1, with an estimation n^ + i of n. 

We show that running Frac-SumRad in phases increases its competitive ratio by no more than a 
constant factor. Let A be the last phase of Frac-SumRad. By Theorem |6j the cost of Frac-SumRad in 
phase I, £ = 1, . . . , A, is at most 2^/30PT^, where OPT^ is the optimal cost for the demands arriving 
in phase £, and /3 is the constant hidden in the O-notation, in Theorem|6] Since the optimal cost OPT 
for all demands is no less than OPT^ , the total cost of Frac-SumRad is at most 2 A+1 /30PT. On the 

2 A-1 

other hand, the total number of demands is at least 2 , because the phase A — 1 is complete, 
and log log n > 2 A_1 . Therefore, the total cost of Frac-SumRad is at most 4/32 A ~ 1 OPT, and the 
competitive ratio is 0(log log n). 
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